AITopics | dataset bias

Collaborating Authors

dataset bias

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

eddc3427c5d77843c2253f1e799fe933-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-11-2026, 00:25:51 GMT

correlation, knowledge, lff, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

7a27143ea615262a0c122eb179c9b7a6-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 00:35:42 GMT

bert, computational linguistic, subnetwork, (15 more...)

Neural Information Processing Systems

Country: Asia > China (0.04)

Genre: Research Report > New Finding (0.93)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Communications (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

LearningDebiasedandDisentangledRepresentations forSemanticSegmentation

Neural Information Processing SystemsFeb-8-2026, 10:56:36 GMT

Despite such phenomenal achievement, semantic segmentation approaches still suffer from the chronic limitations caused byclass imbalance andstereotyped scene contextindatasets.

artificial intelligence, machine learning, representation, (18 more...)

Neural Information Processing Systems

Country: Asia > South Korea > Seoul > Seoul (0.04)

Industry: Transportation > Ground > Road (0.32)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models

Neural Information Processing SystemsDec-24-2025, 13:05:32 GMT

Despite the remarkable success of pre-trained language models (PLMs), they still face two challenges: First, large-scale PLMs are inefficient in terms of memory footprint and computation. Second, on the downstream tasks, PLMs tend to rely on the dataset bias and struggle to generalize to out-of-distribution (OOD) data. In response to the efficiency problem, recent studies show that dense PLMs can be replaced with sparse subnetworks without hurting the performance. Such subnetworks can be found in three scenarios: 1) the fine-tuned PLMs, 2) the raw PLMs and then fine-tuned in isolation, and even inside 3) PLMs without any parameter fine-tuning. However, these results are only obtained in the in-distribution (ID) setting.

robust pre-trained language model, subnetwork, win-win deal, (9 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.55)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.78)

Add feedback

Robot Learning in Homes: Improving Generalization and Reducing Dataset Bias

Neural Information Processing SystemsNov-20-2025, 23:17:49 GMT

Data-driven approaches to solving robotic tasks have gained a lot of traction in recent years. However, most existing policies are trained on large-scale datasets collected in curated lab settings. If we aim to deploy these models in unstructured visual environments like people's homes, they will be unable to cope with the mismatch in data distribution. In such light, we present the first systematic effort in collecting a large dataset for robotic grasping in homes. First, to scale and parallelize data collection, we built a low cost mobile manipulator assembled for under 3K USD.

dataset bias, name change, robot learning, (2 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Robots (0.87)
Information Technology > Artificial Intelligence > Machine Learning (0.59)

Add feedback

Uncertainty Calibration for Ensemble-Based Debiasing Methods

Neural Information Processing SystemsNov-14-2025, 12:03:53 GMT

Ensemble-based debiasing methods have been shown effective in mitigating the reliance of classifiers on specific dataset bias, by exploiting the output of a bias-only model to adjust the learning target.

bias-only model, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country: Asia > China > Hong Kong (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Understanding Bias in Large-Scale Visual Datasets

Neural Information Processing SystemsOct-10-2025, 05:53:15 GMT

However, the concrete forms of bias among these datasets remain unclear.

large language model, machine learning, natural language, (23 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania (0.04)
Europe > Poland (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.68)

Industry: Information Technology (0.67)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

Add feedback

Understanding Bias in Large-Scale Visual Datasets

Neural Information Processing SystemsSep-28-2025, 12:28:31 GMT

However, the concrete forms of bias among these datasets remain unclear.

large language model, machine learning, natural language, (23 more...)

Neural Information Processing Systems

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.68)

Industry: Information Technology (0.67)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(3 more...)

Add feedback

Integrating Object Interaction Self-Attention and GAN-Based Debiasing for Visual Question Answering

Li, Zhifei, Qiu, Feng, Wang, Yiran, Xia, Yujing, Xiao, Kui, Zhang, Miao, Zhang, Yan

arXiv.org Artificial IntelligenceSep-26-2025

Abstract--Visual Question Answering (VQA) presents a unique challenge by requiring models to understand and reason about visual content to answer questions accurately. Existing VQA models often struggle with biases introduced by the training data, leading to over-reliance on superficial patterns and inadequate generalization to diverse questions and images. This paper presents a novel model, IOG-VQA, which integrates Object Interaction Self-Attention and GAN-Based Debiasing to enhance VQA model performance. The self-attention mechanism allows our model to capture complex interactions between objects within an image, providing a more comprehensive understanding of the visual context. Meanwhile, the GAN-based debiasing framework generates unbiased data distributions, helping the model to learn more robust and generalizable features. By leveraging these two components, IOG-VQA effectively combines visual and textual information to address the inherent biases in VQA datasets. Extensive experiments on the VQA-CP v1 and VQA-CP v2 datasets demonstrate that our model shows excellent performance compared with the existing methods, particularly in handling biased and imbalanced data distributions highlighting the importance of addressing both object interactions and dataset biases in advancing VQA tasks. Our code is available at https://github.com/HubuKG/IOG-VQA. ISUAL Question Answering (VQA) [1] is an interdisciplinary field that combines the challenges of computer vision and natural language processing to generate accurate answers to questions about images. This task requires a deep understanding of both the visual content and the contextual nuances posed by the questions, making it a complex and demanding research area. Despite significant advancements in recent years, current VQA models often struggle with biases introduced by training data [2], [3], [4], leading to an over-reliance on superficial patterns and correlations rather than genuine visual reasoning and understanding.

machine learning, natural language, question answering, (17 more...)

arXiv.org Artificial Intelligence

2509.20884

Country: Asia > China > Hubei Province > Wuhan (0.04)

Genre: Research Report > Promising Solution (0.34)

Industry: Education (0.94)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

A Primer on Causal and Statistical Dataset Biases for Fair and Robust Image Analysis

Jones, Charles, Glocker, Ben

arXiv.org Machine LearningSep-5-2025

Machine learning methods often fail when deployed in the real world. Worse still, they fail in high-stakes situations and across socially sensitive lines. These issues have a chilling effect on the adoption of machine learning methods in settings such as medical diagnosis, where they are arguably best-placed to provide benefits if safely deployed. In this primer, we introduce the causal and statistical structures which induce failure in machine learning methods for image analysis. We highlight two previously overlooked problems, which we call the \textit{no fair lunch} problem and the \textit{subgroup separability} problem. We elucidate why today's fair representation learning methods fail to adequately solve them and propose potential paths forward for the field.

artificial intelligence, deep learning, machine learning, (16 more...)

arXiv.org Machine Learning

2509.04295

Genre:

Research Report (0.64)
Overview (0.42)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (0.95)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback